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© The invention provides methods to screen for inhibitors of proteases. The first method involves the 
construction of a ptasmtd system such that expression of a protease is toxic to a bacterial strain. Compounds of 
interest may then be screened for those which inhibit the toxic protease and thereby prevent killing of the 
bacterial strain. Inhibition is detected by successful transformations of the bacterial strain by the plasmid coding 
for the protease. The second method involves the construction of a fusion protein which contains a protease 
cleavage site inserted into a protein whose activity may be assayed. Compounds are screened for protease 
inhibition, which is detected by assaying for a change in color, light or activity of the protein. HIV protease and 
polio 3C protease are examples of proteases that may be used in these methods. 
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SCREENING METHODS FOR PROTEASE INHIBITORS 



BACKGROUND OF THE INVENTION 



Proteases are a class of proteins some of whose members have been implicated in human diseases. 

6 For example, human renin is an aspartyl protease which has been implicated in hypertension through its 
role in the formation in plasma of the vasoactive peptide angiotensin (Bibliography entry 1). The proteases 
trypsin and elastase are involved in the onset of emphysema (2,3). Of particular interest is the protease 
associated with the human immunodeficiency virus ("HIV). Polio protease is also of interest 

The primary translation products of HIV Include polyprotein precursors encoded by the gag and pol 

10 genes (4). The gag gene encodes the capsid proteins; the pol gene encodes the viral enzymes, comprising 
a 10 kd viral protease, a 66 kd reverse transcriptase and a 34 kd endonuclease (5). The production of these 
proteins is dependent upon the cleavage of these polyproteins by the viral protease, which is generated by 
autodigestion of the pol polyprotein. 

As a viral-specific enzyme, the protease is an attractive target for drugs for the inhibition of viral 

76 replication, and has become the focus of intensive investigation. The pol sequence (nucleotides 1833-2129) 
produces a mature 99 amino acid protease (6), with proline at the amino terminus and phenylalanine at the 
carboxy-terminus (7; Graves). The 99 amino acid protease is homologous to the aspartic protease family 
(8), and is inhibited by pepstatin A (9). Recombinant protease mutated at the catalytic Asp 25 residue is 
enzymatically inactive in bacterial expression systems and in vitro (5,6), and virus mutated at this site is 

20 non-infectious (6). The protease has recently been purified from E. coil and crystallized (10). 

At present there is an urgent need to develop therapeutic agenti against acquired immune deficiency 
syndrome ("AIDS"), which is believed to be caused by HIV. An agent which is selective for HIV protease 
might serve to inhibit HIV replication and thus be efficacious in the treatment of AIDS. There is thus a need 
for methods to screen for such Inhibitors of HIV protease. 

25 Existing methods for assaying HIV protease utilize polyprotein or oligopeptide substrates, and they 
require gel electrophoresis or high performance liquid chromatography to ascertain the presence of 
cleavage products (11-16). Other methods recently developed for multiple samples include resonance 
energy transfer (17) and release of radioactive products (18). These methods are not easily adapted for 
assaying protease activity in large numbers of samples. 

30 Accordingly, it is an object of this invention to develop screening methods for identifying inhibitors of 
proteases. 

It is a particular object of this invention to develop screening methods for identifying inhibitors of HIV 
protease. 

It is a further object of this invention to develop screening methods for identifying inhibitors of polio 
35 protease. 



SUMMARY OF THE INVENTION 

40 

The present invention provides methods for screening for compounds which inhibit protease enzymes. 
In one embodiment of the invention, the method is based first upon the construction of a plasmid having a 
DNA sequence which encodes a protease under the control of a T7 promoter. Next a bacterial expression 
strain is selected which is not transformed by the plasmid because of the toxicity of the protease to the 
45 bacterial strain. A compound to be screened for inhibitory activity is then added together with the plasmid to 
the bacterial strain. If the bacterial strain is not killed and produces transformants, the compound is 
identified .as an inhibitor of the protease. 

In particular, the first embodiment of the present invention provides plasmid constructs designated 
pPDB16 and pMae2 which contain a DNA sequence coding for HIV protease. These plasmids do not yield 
so transformants in an E. coli strain designated BL21(DE3). Thus, expression of HIV protease from the plasmid 
appears to be toxic tolhis E. coli strain. This discovery suggests a method for identifying agents which 
inhibit HIV protease. A similar plasmid construct, designated pPolio protease 3C, is described for polio 
protease, which may be used to screen for inhibitors of polio protease. 

In another aspect of the first embodiment of the invention, a method for screening for compounds which 
inhibit HIV protease is described. A plasmid is constructed having a DNA sequence which encodes HIV 
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protease under the control of a T7 promoter. Next, a bacterial strain is selected which is transformed by the 
plasmid and is then kilted by the induction of protease synthesis by an inducing agent A compound to be 
screened for inhibitory activity is then added together with the inducing agent to the transformed bacterial 
strain. If the induced cells are not killed, the compound is identified as an inhibitor of HfV protease. 

5 In the second embodiment of the invention, the method involves the selection of a protein which is 
capable of being assayed for a change in color, light or activity when combined with a protease enzyme in 
the presence or absence of a potential inhibitor compound. 

Next, a heterologous DNA sequence coding for a peptide which includes a protease enzyme cleavage 
site is inserted into the gene coding for the selected protein. The fused gene is inserted into a suitable 

10 expression system and is combined with the protease enzyme which recognizes and cleaves the peptide 
coded for by the heterologous DNA sequence embedded within the fusion protein, such that the protease 
enzyme cleaves at the protease enzyme cleavage site, which in turn inactivates the expressed fusion 
protein. 

This loss of activity is monitored by means of a color change, light change or by directly monitoring the 
is loss of activity. A compound to be screened is included in the assay system and is identified as an inhibitor 
of the protease enzyme if the protein is not cleaved and remains active when monitored by any of the 
methods described previously. 

In particular, the second embodiment of this invention provides a plasmid construct designated 
p1 +!QSau, which includes the gene for 0-galactostdase, into which is inserted a DNA sequence coding for 
20 a decapeptide HIV protease cleavage site. When expressed, this fusion protein retains 0-gaIactosidase 
activity. HIV protease cleaves and inactivates the fusion protein. 

Potentially inhibitory compounds of HIV protease are screened by monitoring 0-galactosidase activity, 
such that maintenance of 1-gaJactosidase activity indicates that the compound prevents cleavage and - 
inhibits HIV protease. 

29 A similar construct, designated plasmid pZM3CSau. includes the gene coding for 0-galactosidase into 
which is Inserted a DNA sequence coding for a polio 3C protease cleavage site. This construct is used to 
screen for compounds which inhibit polio 3C protease using procedures similar to those described 
previously for HIV protease. 

30 

BRIEF DESCRIPTION OF THE DRAWINGS 



Figure 1 depicts the HIV protease constructs pPDB16 and pMae2. Construct pPDB16 encodes the 18 kd 

35 protease precursor under the control of the T7 promoter. The BgllhDral fragment from XBH5 (4), 
corresponding to nucleotides 1418-1899 of the HIV pol gene, is subcloned into the Ndei-BamHI sites of 
the previously described expression vector (19). using synthetic oligonucleotides. The upstream 
oligonucleotides begin with an Ndel restriction site, providing an in-frame translationai start codon, and 
regenerate nucleotides 1406-1417. The downstream oligonucleotides regenerate nucleotides 1900-1906, 

40 followed by two in-frame stop codons and a Bam HI restriction site. For clarity, base A of the initiation 
codon is defined as nucleotide (nt) 1. Amino acids +1 to +99 represent the mature 10 kd protease 
liberated by autodigestion at the indicated Phe-Pro bond (7). pMae2 encodes the mature 10 kd protease. 
Its construction is similar to PPDB16, except that the upstream oligonucleotides regenerate from 
nucleotide 1610 to the Maelll restriction site. 

45 Figure 2 depicts the expression of HIV protease in E. coli . Panel A: Protease synthesis in BL21(DE3>- 
pLysS cells harboring pPDB16. Proteins are labelled witiT 3s S methionine as described (19). followed by 
electrophoresis and autoradiography. Lane 1, no isopropyl-^-D-thiogalactopyranoside f IPTG"). Lane 2. 
1mM IPTG. The arrow denotes the mature 10 kd protease. Panel B: Protease synthesis in BL21(DE3) 
survivors harboring mutagenized pPDB16. The numbers above each lane correspond to the clone 

so designations in Tables II and III. Molecular weight markers are shown on the left (68 kd - bovine serum 
albumin; 43 kd -ovalbumin; 29 kd - carbonic anhydrase: 18 kd -beta-iactoglobulin; 14 kd - lysozyme). 
Rgure 3 depicts the killing of BL21 (DE3)pLysS upon induction of HIV protease synthesis with IPTG. 
Cells harboring plasmid encoding wild type (pPDBl6) or mutant (pP48) protease precursor are grown to 
0.5 absorbance units (600 nm), and IPTG is added to the indicated concentrations. At the indicated 

55 times, an aliquot of each culture is removed, and dilutions are plated on LB plates containing kanamycin 
and chloramphenicol. After overnight incubation, colony forming units ("CFU") per 100 ul of culture are 
determined. 

Rgure 4 depicts the comparison of amounts of wild-type and mutant protease production. BL21(DE3)- 
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pLysS cells harboring the indicated plasmids are induced with 1mM IPTG, and proteins are labelled with 
methionine as described (19). Whole cell extracts (lanes 1-5). and immunoprecipitatas (lanes 6-10) 
from antiserum raised to amino acids 34-46 of the mature protease, are subjected to electrophoresis and 
autoradiography. Lanes 1 and 6, pPDB16. Lanes 2 and 7, pP48. Lanes 3 and 8. pMae2. Lanes 4 and 9, 
5 pP48Mae6. Lanes 5 and 10, pPol, encoding the polio polymerase (19). Lane M, molecular weight 
markers (68 kd - bovine serum albumin; 43 kd - ovalbumin; 29 kd -carbonic anhydrase; 18 kd - beta- 
lactoglobulin; 1 4 kd -fysozyme; 6 kd * bovine trypsin inhibitor). 

Figure 5, Panel A depicts the HIV protease plasmids p1 +IQ and p15-IQ which encode wild type and 
mutant HIV protease, respectively, under tac promoter control. E. coll strain MC1061 harboring these 

io plasmids is used as a source of HIV protease for in vitro cleavage~of 0-galactosidase. For monitoring 
cleavage in vivo , the 0-galactosidase/cleavage cassette genes are excised from the pZM plasmid series 
with Stul and BamHI, and Ggated into the BamHI site of p1 +IQ or p15-IQ using BamHI tinkers, so that 
HIV protease and 0-galactosidase are co-expressed in the same E. coli cell. Panel B depicts the 
insertion of the HIV protease cleavage she into 0-gaiactosidase. Oligonucleotides encoding the HIV 

75 protease cleavage site are inserted into the indicated restriction sites of the 0-galactosidase gene. The 
nucleotide (nt) and amino acid (aa) positions with 0-galactosidase are indicated. 
Figure 6, Panel A depicts the in vitro cleavage of gag polyprotein by HIV protease. Gag polyprotein is 
incubated with buffer (lane 1), or with wild type (lane 2) or mutant (lane 3) HP/ protease, followed by 
electrophoresis and autoradiography. Molecular weight markers are shown on the left (67 kd -bovine 

20 serum albumin; 42 kd - ovalbumin; 28 kd -carbonic anhydrase; 18 kd • beta-lactoglobulin). Panel B 
depicts the in vitro cleavage of 0-galactosidase/cleavage cassette proteins by HIV protease. Extracts of 
MCI 061 cefii harboring HIV protease plasmid are incubated with extracts harboring l-galactosidase, 
followed by Western blotting with anti-0-gaiactostdase. Lane 1, MC1061 (no plasmid). Lane 2, wild type . 
HIV protease extract Lane 3, mutant HIV protease extract MC1061/pZMl extract (lane 4) is incubated 

25 with wild type (lane 5) or mutant (lane 6) HIV protease. MC1061/pZM-$au extract from two independent 
clones (lanes 7 and 10) is incubated with wild type (lanes 8 and 11) or mutant (lanes 9 and 12) HIV 
protease. The upper arrow on the right indicates ^3-gaJactosidase; the lower arrow on the right indicates 
the cleavage product from pZM-Sau. Molecular weight markers are shown on the left (103 kd - 
phosphorylase b; 67 kd - bovine serum albumin; 42 kd - ovalbumin; 28 kd - carbonic anhydrase). 

30 Figure 7 depicts the inhibition of cleavage of 0-galactosidase by Pepstatin A. The 0- 
galactosidase/cleavage cassette protein from MC1061 cells harboring pZM-Sau is incubated with HIV 
protease in the presence or absence of Pepstatin A, followed by Western blotting with anti-0-galac- 
tosidase. Lane 1, no protease. Lane 2, with protease added. Pepstatin A dissolved in dimethylsutfbxide is 
added to the incubation at concentrations of 0, 0.2, 1, 5. 25 and 50 ug/ml (lanes 3-8). The upper arrow 

35 on the right indicates 0-galactosidase; the lower arrow on the right indicates the cleavage product from 
pZM-Sau. The molecular weight markers shown on the left are identical to those in Figure 6, Panel B. 
Figure 8 depicts the in vivo cleavage of l-gatactosidasa/cteavage cassette proteins by HIV protease. 
Panel A: MC1061 cells harboring p1+ IQSau (lane 1), pl+IQ-CtaA (lane 2), pl+IQ-ClaB (lane 3), 
p1 +IQ-Bss (lane 4), pi + IQ-Eco (lane 5), p1 + IQ (lane 6), or without plasmid (lane 7), are grown to mid- 
40 log phase, and IPTQ is added to 1 mM. After 30 minutes, cells are pelleted and resuspended in sample 
buffer, followed by electrophoresis and Western analysis with anti-0-galactosidase. Panel B: MC1061 
cells without plasmid (lane 1) or harboring p1 +IQSau (lane 2) or p15-IQSau (lane 3) are treated as in 
panel A. In each panel, the upper arrow on the right indicates l-galactostdase; the lower arrow on the 
right indicates the cleavage product from p1 + IQSau. In each panel, the molecular weight markers are 

4$ identical to those in Figure 6, panel B. 

Figure 9 is a photograph which depicts the results of an in vivo cleavage assay. Samples of E. coli strain 
MC1081 harboring various plasmids are spotted onto plates containing the color indicator X-gal. The 
plasmids are pZM-1 (a positive control for 0-galactosidase activity), pZM-2 (a negative control for 0- 
galactosidase activity), pZM-Sau, p1 + IQSau (labelled 1+IQB-Sau) and p15-IQSau (labelled p15-IQB- 

50 Sau). In the cleavage assay with wild-type HIV protease, constructs which cleave l-galactosidase 
produce a white color, while constructs which do not cleave 0-galactosidase produce a blue color. 
Plasmid pZM-1 is constructed as described below. Plasmid pZM-2 encodes mutant, inactive 0- 
galactosidase. and so produces white colonies. Plasmid pZM-2 is constructed by digesting plasmid 
pZM-1 with EcoRI and BamHI to delete the DNA sequence coding for the 18 amino adds at the carboxy- 

55 terminal end of l-gaJactostdase. As expected, pZM-1 produces a blue color, as do pZMSau and p15- 
IQSau. Plasmid pZM-2, as a negative control, produces a white color; 0-galactosidase from p1 + IQSau is 
cleaved by HIV protease and produces a white color. 

Figure 10 depicts the cleavage of polio protease cleavage cassette by polio protease in vitro . Extracts of 
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0-galactosidase containing either the polio protease cleavage site (in MCl061/pZM3CSau) or the HIV 
protease cleavage site (in MCl081/pZM-Sau) are incubated with either polio or HIV protease, followed by 
Western analysis with anti-0-galactosidase. Lane 1, MC1081 without plasmid. Lane 2, polio 3C protease. 
Lane 3, HIV protease. Lane 4, MC1081/pZM3CSau unincubated. Lane 5. MC1061/pZM3CSau incubated 
with polio 3C protease. Lane 6, MC1061/pZM3CSau incubated with HIV protease. Lane 7. MC1061/pZM- 
Sau unincubated. Lane 8, MC1061/|pZM-Sau incubated with polio 3C protease. Lane 9. MCl061/pZM- 
Sau incubated with HIV protease. Lane 10. MC1061/pZM1 unincubated. Lane 11. MC1061/pZM1 
incubated with polio 3C protease. Lane 12. MC1061/pZM1 incubated with HIV protease. The upper arrow 
on the right indicates tf-galactosidase; the lower arrow on the right indicates the cleavage product from 
the Sau constructs. Molecular weight markers are shown on the left (103 kd - phosphorylase b; 67 kd - 
bovine serum albumin; 42 kd - ovalbumin. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention is directed to two screening methods for inhibitors of proteases such HIV 
protease, renin, trypsin, elastase and poHo protease. Each screening method will first be described with 
respect to HIV protease and then for polio protease. The methods may also bo adapted for use with other 
proteases in accordance with the principles of the invention. 

The first screening method for inhibitors of HIV protease is directed to the construction of a plasmid 
which expresses HIV protease that is toxic to an E coli strain. This toxicity, in turn, is useful to screen for 

agents which inhibit HIV protease. 

To study the HIV protease, the Bgllt-Dral fragment from XBH5 (4), corresponding to nucleotides 1641- 
2122 of the HIV pol gene encoding the protease, is subcloned into a rjreviously-prepared expression vector 
(pT7K) containing~a phage T7 promoter (19). creating plasmid pPDB!6 (Fig. 1). The construction of 
PPD818 is carried out in E. coli strain DH5a (Bethesda Research Laboratories. Qaithersburg, Maryland), 
prior to transformation intoThe~xpression strain. E. coli BL21(DE3)pLysS. It is necessary to carry out this 
construction in a strain such as DH5a. which does not contain T7 RNA polymerase and. therefore, cannot 
express HIV protease, since expression of HIV protease will be shown to be toxic to the expression strain. 

An E coli strain DH5a harboring plasmid pPDB16 has been deposited with the American Type Culture 
Collect Und has been assigned ATCC accession number 68005. Plasmid pPDB16 may be obtained 
through conventional techniques (20) by growing this strain in culture, breaking open cells, centrifuging and 
isolating the plasmid from the supernatant 

The expression strain E. coli BL21(DE3)pLysS contains a chromosomal copy of T7 RNA polymerase 
under the control of an indudble lacUVS promoter, and a chloramphenicol-resistant plasmid encoding T7 
lysozyme which inhibits transcription by T7 RNA polymerase in the absence of induction (21.22). Cells 
harboring pPDB16 are induced with the inducing agent IPTG in the presence of »S methionine, and protein 
synthesis is examined by sodium dodecyl sulfate-poiyacrylamide gel electrophoresis ("SDS-PAGE") using 
a 15 percent polyacrylamide gel (comprising 15 ml of 30% acrylamide, 1.3 ml of 2% bisacrylamide. 11.2 ml 
of 1M TrisCI (pH 8.7), 0.15 ml 20% SDS. 2.35 ml of H2O, 25 ul N.N.N ,N .tetramethylethylenediamine and 
100 ul 10% ammonium persulfate (20)). and autoradiography. The HIV DNA fragment in pPDBie is 
expected to encode a primary translation product of 18 kd. Instead, as shown by SDS-PAGE. aprominent 
band of 10 kd is observed upon induction (Fig. 2A. lane 2). This observation is consistent with efficient 
autodigestion of the 18kd precursor into the mature 10 kd protease (the biologically active form) at the 
phenylalanine-proline bond (Fig. 1). and is in agreement with results reported previously (5.7). 

The 10 kd HIV protease band detected by autoradiography is not visible by Coomassie blue staining, 
suggesting that only low levels of this protein are made. It has been previously determined that the absolute 
level of expression of a cloned gene under T7 promoter control is lower in BL21(DE3)pLysS than in the 
corresponding strain lacking lysozyme (BL21(DE3) (21)), due to the inactivation of T7 RNA polymerase by 
lysozyme (22). However, genes under T7 promoter control which are toxic to Ecoh will not produce stable 
transformants in BL21 (DE3). which lacks the lysozyme plasmid. pLysS (21 ). . _ 

In an attempt to increase the level of expression of the HIV protease, the use of the expression strain 
BL21(DE3) is investigated by applicants. The results of two transformation experiments are shown in Table 
I: 
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TABLE I 





No. of transformants 


Ptasmid 


BL21(DE3) 


BL21(DE3)pLysS 


A. 


pPDB16 


0 


400 


pPoll 


400 


400 


pPol2 


350 


300 


B. 


pMae2 


0 


960 


p48Mae6 


330 


840 


pPoll 


600 


2400 



Table I. Transformation of 8L21(DE3) strains by HIV protease constructs. BL21(DE3) ceils containing or 
lacking the pLysS plasmid are made competent by the standard calcium chloride procedure (20). Fifty ng of 
protease or control plasmid are used for each transformation. Ptasmids pPDB16 and pMae2 are as 
previously described; plasmid pP48Mae6 encodes HIV protease with a point mutation at amino acid 29. 
Plasmids pPoll and pPo!2 encode the polio polymerase (19). 

Whereas several plasmids as controls transform strains 8L21(DE3) and BL21(DE3)pLysS equally well, 
plasmids pPDB16 and pMae2 encoding HIV protease do not yield transformants in BL21(DE3). These data 
strongly suggest that expression from pPDB16 or pMae2 is toxic to E. coll BL21(DE3). This toxicity, in turn, 
suggests a method of identifying agents which inhibit HIV protease. 

The apparent toxicity of HIV protease could be due to its inherent enzymatic activity, or to some other 
effect of this protein. This question is answered by isolating mutants of the HIV protease by screening for 
viable transformants of BL21(DE3). To introduce mutations into the HIV protease, plasmid pPDB16 is first 
transformed into E. cofi strain LE30 mutO (Cold Spring Harbor Laboratory, Cold Spring Harbor, New York). 
This strain has alO^O 4 higher frequency of spontaneous mutation compared to wild-type E. coli (23). 
Plasmid is prepared from 50,000 pooled LE30 mutO transformants. and this collective plasmid is used for 
the transformation of BL21 (DE3). Approximately 150 colonies are obtained. These clones are examined for 
HIV protease production by labelling of celts with 35 S methionine in the presence of IPTG, followed by SDS- 
PAGE using the system described previously and autoradiography. As shown in Fig. 2B, these putative 
mutants do not produce the 10 kd protBin observed for the wild-type clone. Instead, they produce protein of 
either 18 kd (lanes M3, M4, M5), or protein smaller than 18 kd, but not 10 kd. (lanes M2, M6, M7). These 
proteins are only observed upon induction with IPTG, indicating that they are under the control of the T7 
promoter. In addition, when plasmid isolated from these putative mutants is added to a ONA-dependent 
transcription-translation system from E. coli , protein of the same size as those seen in labelled cells is 
observed only in the presence of T7 RNA polymerase. 

To determine if a specific genetic lesion could be correlated with the altered size of the inducible 
protein, the entire 500 bp HIV DNA insert of each putative mutant is sequenced. The first group of 18 BL21- 
(DE3) survivors is comprised of eight clones producing 18 kd protein (M3, M4, M5, M8, M9, M12, P2, and 
P5), seven clones producing protein smaller than 18 kd (M2, M6. M7, M10. M11, P49 and P74). and three 
clones which do not produce protein upon induction with IPTG (Ml, P3, and P26). 

The sequencing results are presented in Tables II and III. Table II presents nonsense mutants of HIV 
protease: 
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TABLE II 



Nonsense Mutants Of HIV Protease 


Clone 


Mutation 


P26 


CAG - UAQ at nt 60 


M7 


CAA — UAA at nt 225 


P48 


CGA - UQA at nt 228 


M10, M11 


3 insertion at nt 259. UGA at nt 293 


M2 


G deletion at nt 352, UAA at nt 373 


P74 


G insertion at nt 353. UGA at nt 383 


M6. P52 


CAG — UAG at nt 387 



75 

Table II. The entire 500 bp protease insert of pPDB16 from BL21(DE3) survivors is subjected to ONA 
sequence analysis, and a premature stop codon is identified. Clone P26 encodes a truncated protease of 2 
kd which is not detected upon labelling of E. coli proteins, probably due to Its small size. Each other clone 
encodes a protein smaller than 18 kd whirls detected by labelling (see Fig. 2B, lanes M2, M6, M7). 

20 Each clone which produces protein smaller than 18 led contains a premature translation termination 
codon (Table II). The premature termination codons of clones M6, M7. and P49. result from a C -> U 
transition. For clones M2, M10. M11, and P74, the insertion or deletion of a G residue within a stretch of 5 
or 6 G's results in a translationaJ frameshrft which should lead to termination as indicated (Table II). In each _ 
case, whether a transition or frameshrft occurs, the size of the protease predicted from the DNA sequence 

25 is in excellent agreement with the size of the protein observed upon induction (Fig. 2B). 

Clone P28, which apparently does not produce labelled protein upon induction with IPTG, is shown to 
contain a premature translation termination codon at nucleotide 61 (Table II). The truncated protein 
expected upon induction of this construct would be approximately 2 kd. and Is too small to be resolved by 
the gel system used, which is based upon that of Plotch et a). (19). Clones M1 and P3 are also subjected to 

30 DNA sequence analysis, and are found to encode wild-type protease. Since whole plasmid is subjected to 
mutagenesis in strain LE30mutO, these clones may contain mutations outside of the HIV protease region 
that would prevent transcription or translation from genes under T7 promoter control, such as mutations in 
the promoter region itself or in the ribosome binding site (21). 

For each clone which produces 18 kd protein, a single nucleotide mutation leading to a single amino 

35 acid substitution is detected by DNA sequencing (Table III). After these initial results with the first group of 
18 clones are obtained, only those clones which produce 18 kd protein upon induction are examined in 
detail. The specific amino acid change determined for each mutant is summarized in Table III: 
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Migsense M utants Of HTV Protease 



Clone Position Mutation 





Nucleo- 
tide 


amino 
acid 








P57 


196 


-4 


UCC - 


— > 


uuc 








Ser 




Phe 


P46 


211 


2 


CAG ■ 


— > 


CGG 








Gin 




Arg 


P12, P37 


214 


3 


AUC • 


— > 


AAC 








He 




Asn 


P32 


250 


15 


AUA 


— > 


ACA 








He 




Thr 


P29 


270 


22 


GCU 


— > 


ACU 








Ala 




Thr 


M12 


285 


27 


GGA 


— > 


AGA 








Gly 




Arg 


P48 


292 


29 


GAU 


— > 


GGU 
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ASp 




Gly 




P15 


297 


31 


ACA 


— > 


GCA 


5 








inr 




Ala 




PI, P5, P80 


319 

^to 


38 






C 


10 


P66 


339 


45 


AAA 


> 


GAA 








l*y8 




GlU 




M3. M5 


34fi 


A1 
Q / 


AUA 

lie 


> 


ACA 

Thr 


IS 


P31, P44 


3SS 


so 


auu 

lie 


> 


ACU 

Thr 




M9 


381 


59 


UAU 


> 


CAU 


20 








Tyr 




His 




P24 




OU 


GAU ■ 
Asp 


— > 


GUU 
Val 




P13, P47, P68 


403 


66 


AUC 


> 


ACC 


25 








lie 




Mbe_ 

Thr 






424 


73 


GOT - 
Gly 


> 


GAU 

His 


30 


P45, P79 


427 


74 


ACA • 

Thr 


> 


ADA 
He 




M8 


433 


76 


DUA • 


> 


UCA 


35 


P2, P8 


436 


77 


jjeu 

VlUA 

Val 




ser 

GCA 
Ala 




P22 


442 


79 


ecu ■ 


> 


cuu 


40 








Pro 




Leu 




P38 


460 


85 


AUU • 
He 


> 


ACU 
Thr 


45 


P9 


463 


86 


GGA - 
Gly 


> 


GAA 
Glu 




P50 


465 


87 


AGA - 


— > 


GGA 


50 








Arg 




Gly 


P27 


484 


93 


AUU - 
He 


> 


ACU 
Thr 



55 Table III. Missense mutants of HIV protease. The entire 500 bp protease insert of pPDB16 from BL21(DE3) 
survivors producing 18 kd protease precursor (see text and Fig. 2B) is subjected to DNA sequence analysis, 
and a single point mutation is identified in each clone. Nucleotide and amino acid numbering are described 
in Rg. 1. 
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In all. 24 different point mutations are identified. Six of these mutations, at amino acids lie 3, lie 47. He 
50, Gly 73, Thr 74 and Val 77, are identified in two independent clones. The mutations at Leu 39 and He 66 
are each identified in three independent clones. Because mutations are reisolated without obtaining some 
other known mutations (e.g., mutation at Asp 25) (4,5), it is possible that sister ptasmids from the 

5 mutagenized plasmid pool are being detected. 

Some of these mutations can be grouped according to the proposed function of the amino acid within 
HIV protease. The triplet Asp, Thr, and Gly, is conserved throughout the family of aspartyl proteases (8) and 
helps to locate the active site of HIV protease at residues 25, 26, and 27. Mutations at or near these 
residues should render the protease enzymatically inactive by disruption of the catalytic site. The three- 

io dimensional structure of HIV protease has been recently solved (10), and it appears that applicants' 
mutations at Ala 22, Gly 27, Asp 29, Thr 31 , lie 85. Gly 86. and Arg 87, are within the cleft of the active site. 
Applicants also find mutations in amino acids which are proposed to participate in hydrogen bonding within 
the protease, including Ala 22, Lys 45, lie 47, He 66, Gly 73, Vai 77, and lie 85. Finally, the HIV protease is 
probably a dimer based on X-ray crystal structure (10,24), and applicants' mutations at Gly 27, He 50, and 

rs He 93 should interfere with dimer formation. 

Loeb et at. (25) performed extensive site-directed mutagenesis between amino acids 21 and 46 of the 
HIV protease, and then determined processing activity. Most missense mutations abolished processing 
within the region, which is highly conserved among retroviral proteases. These mutations include Asp 29 -> 
Gly, and Thr 31 -> Ala, in agreement with applicants' results. However, in contrast to the approach of Loeb 

20 et al., applicants' method of mutant isolation appears to select only for non-functional protease. Consistent 
with this are 31 mutations in amino acids 21-46 which Loeb et al. determined had little or no effect on 
processing. None of these mutations comes through applicants' screen. Applicants' results also indicate that 
point mutations throughout the protease, outside of the conserved regions, can destroy the activity of the 
enzyme (Table III). 

25 Kraussflch et al. (26) have shown that deletion of the carboxy-terminal 17 amino acids from the 18 kd 
HIV protease precursor prevents autodigestion which would liberate the mature 10 kd protease and 
furthermore, that the precursor is unable to cleave a gag substrate in trans . Similar results were obtained 
by Hansen et al. (27). Four of applicants' point mutations, at lie 85, Gly 86, Arg 87, and He 93, are within 
these 17 carboxy-terminai amino acids. 

30 Applicants' results indicate that the mutant 18kd protease precursor is incapable of autodigestion, and is 
apparently not toxic to BL21(DE3). Without being bound by the ensuing discussion, applicants propose that 
the 10 kd mature protease exerts its toxic effect by digesting an unidentified essential E. coll protein, and 
that the 18 kd precursor is unable to carry out this digestion. Failure to generate the maturelO kd protease 
could result from mutating an amino acid required for catalytic activity, or from disruption of the scissile 

35 Phe-Pro bond (located at amino acids -1 to +1 of the mature protease), such that autodigestion of the 18 
kd precursor could not occur. Applicants propose that the mutations at Ser -4, Gin 2, and He 3 (Table II), 
render the Phe-Pro bond uncleavable. Loeb et al. (25) used site-directed mutagenesis of the pol polyp rotein 
(which encodes the protease, reverse transcriptase, and endonuclease) to introduce substitutions at the 
upstream Phe-Pro bond. One class of mutants is defective at autodigestion, but can still cleave at the non- 
40 mutated reverse transcriptase and endonuclease borders. This processing results either from a small 
amount of undetected mature protease, or from the precursor which retained activity. Another class of 
mutants which fails to liberate the mature protease is also inefficient at cleavage to generate the reverse 
transcriptase and endonuclease. It is possible that applicants' method of mutant detection, the requirement 
for stable transformation of BL21(DE3), selects for 18 kd precursors which are enzymatically inactive. 

45 To directly examine the effect of mature protease on the viability of BL21(DE3), applicants construct 
plasmids which encode only the 10 kd protease, bypassing the need for autodigestion of precursor (Fig. 1). 
As shown in Table IB, the wild-type HIV 10 kd protease plasmid (pMae2) does not transform strain BL21- 
(DE3) successfully, but is able to transform BL21 (DE3)pLysS. Plasmid p48Mae6. encoding protease with a 
point mutation at amino acid 29 (Table HI), is able to transform both E. coli strains. 

so Further evidence that the HIV protease is toxic to BL21(DE3) is~provided by an examination of the 
survival of cultures in the presence of IPTG. In this experiment, cultures of BL21(DE3)pLysS cells harboring 
wild-type protease (pPDBl6) or mutant protease (pP48) are grown to mid-log phase, and protease synthesis 
is induced with IPTG. The number of viable ceils is determined by plating aliquots of cultures for colony- 
forming units. After 30 minutes of exposure to 1.0 mM or 0.1 mM IPTG, cells harboring wild type protease 

55 experience approximately four togs of killing compared to untreated cells (Fig. 3). In contrast, cells 
harboring the mutant protease are unaffected by the addition of IPTG. These data indicate that HIV protease 
is extremely toxic to BL21(DE3). Similarly, cells harboring plasmid encoding wild-type 10 kd protease 
(pMae2) are significantly more sensitive to IPTG than cells harboring plasmid encoding the corresponding 
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9 



5 



» 



25 



30 



36 



40 



46 



mutant protease (p48Mae6). . . . . . 

An E strain BL2KDE3) harboring plasmld pP48 has been deposited with the American Type 

Culture TJolliction and has been assigned ATCC accession number 68004 This f"* 1 **^"? 
may be subjected to one of the following methods to delete pP48 so as to leave E. cob strain BL21(DE3) 

<28 lTcuring with agents which preferentially inhibit plasmkJ replication, such as acridine orange, acnflavin, 
quinacrine. or ettidium bromide: For example. BL21(DE3)/pP48 is grown in Wc^»^&L The 
medSTs log peptone and I0g yeast extract per liter, adjusted to pH 7.6 with NaOH. Then 10* ceHs 
oTml are inflated into a series of tubes containing 10-100 ugAnl (10 ugftnl increments) of acndine 
orange. These cultures are incubated overnight at 37* in the dark. Cells are streaked onto LB p ates 
from the culture containing the highest concentration of acridine orange which showed growth. irrfrviduai 
colonies are tested for kanamycin resistance. Colonies which are kanamycin sensitive should have lost 
pP4& so that only E. coll strain BL21(DE3) remains. 

2) Spontaneous curi"ng7?or example, BL21(DE3)/pP48 is grown to stationary phase several tunes m LB 
bracking kanamycin. Cells are streaked onto LB plates. Individual colonies are tested for kanarnycm 
resistance. Colonies which are kanamycin sensitive should have lost pP48. so that only E. con strain 

BL21(DE3) remains. . . 

3) Curing with agents which inhibit DMA gyrase, such as coumermycin or novobtoan: This method Is 
identical to acridine curing, except curing agents are used at 1 to 7 micrograms per ml 

4) Curing with an agent which inhibits RNA polymerase, such as rifampicm: This method is identical to 
acridine curing, except rifamptcin is used at 5 to 10 micrograms per ml. 

The amount of protein produced upon IPTG induction by cells harboring plasmids encoding wild-type or 
mutX^ons of the IBM or 10 kd protease is examined by -8 methionine ^n^and-m- 
munoprecipitation. followed by gel electrophoresis and autoradiography (Fig. 4). ttoesttnatt ^ 
pTanTp4aMaee. which encode the 18kd precursor and the 10 kd protease respective^ ( anes 2 and 4)^ 
produce as much as ten-fold more protease than the corresponding wild type clones PDB16 (lane 1) and 
Maa2 (lane 3) The decreased amount of the wild-type protease produced compared to the mutant is 
probably due to the Wiling of cells by wild-type protease upon induction with IPTG (Fig 3) Unexpected^, 
however, the mutants still produce very low levels of the protease or precursor, despite their lack o toxicity. 
The mutant proteins cannot be detected by Coomassie blue staining of the corresponding geto. Since 
different genes are expressed at different efficiencies in the T7 system (21). there may be a bamer to 
overproducing HIV protease, even when its toxicity is apparently alleviated. 

The HIV protease has been successfully expressed in a variety of E. coll K12 strains (5.6.7, *27). 
although at least In HB101. Inhibition of growth upon induction of protease synttiesishas been ireported 
So) The HIV protease may be especially toxic to BL2KDE3). which is an E. coll B strain. The toxictty of 
HiV protease to E. coll BL21(DE3)21 appears to indicate that the protease is functioning WJ*- £ 
contrast to the resultsle-ported in this application. Krausslteh et al. (15) report expression of HIV Proteased 
strain BL2KDE3) lacking the lysozyme plasmld. Differences between the Krausslteh et al. protean 
coquet and that of the present invention may account for the consistent observation** BL2 1(DE3 
Zbe transformed by P PDB16. The amino terminus of applicants' 18 kd protease is ^ l^of theg 
g^e preceded by a single methionine residue, whereas that of KraussHch et al. is codon 5 of tojrtgene 
Sc^dedby 12 amino adds from phage T7 gene 10 (26). In addition, the novel protease construct of this 
VpLnid encoding'reSstance to kanamycin. not ampicillin. However the Protease Js^so 
toxic to BL2KDE3) when introduced on a vector specifying ampicillin resistance, .nd.cat.ng that the putative 
target of the HIV protease is probably not the phosphotransferase which confers kanamyan 

Recent studies with peptide substrates have identified minimal sequence requirements for cleavage by 
the H^pltai r^tew as seven amino acids are sufficient (11.12.16). Thus, the bacteria^ ^rote.n 
which Is ft* putative target of the protease need bear little homology to the natural gag and pol substrates. 
HTmX of^ HrTprotease to E. col. strain BL21(DE3) provides a method to isolate mutants of this 
IZ^m enzyme.'and. more Importantly, the potential to screen for protease inhibitors useful for 

*Tp^* hte^monstrated that HPV protease plasmids P PDB16 and P Mae2 are unable ,» ^success- 
fullyTansform E. coll strain BL21(DE3) (Table I). As a screen, these transformations are attempted .n *e 
onL n ce of potenSaTHIV protease inhibitors. Successful transformations serve to .dentrfy Inhibitors of the 
, JZS t Ser experiment, applicants ateo demonstoate that phage encoding Hrv protease is unable to 
transfect E. coll strain BL21(DE3)F+. These transfections may also be attempted ,n the presence of 

P0 CEtS£? ofTe first method for screening invohves the use of BL21(DE3)pLysS transformed 
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with HIV protease plasmids pPDB16 or pMae2. The induction of protease synthesis by IPTG kills these 

cells, as demonstrated in Figure 4. Potential protease inhibitors might prevent the death of induced cells. 
The foregoing experiments with the first HIV protease screening method may be adapted to identify 

inhibitors of polio protease. The starting material for this embodiment of the invention is poliovirus type 1 
s (Mahoney strain) which has been deposited previously with the American Type Culture Collection and has 

been assigned ATCC accession number ATCC VR-59. This poliovirus strain is used to generate the polio 

protease 3C through a series of steps described below. 

In summary, poliovirus strain 1 Is first used to generate the plasmid pT7pv1-5 (31). The piasmid 

contains the entire cDNA sequence for poliovirus strain 1 under the control of the T7 promoter described 
io previously. Polio protease 3C is then subcloned from this plasmid. 

Specifically, in a first step, cONA is prepared from poliovirus strain 1 and is subcloned into the T7 

vector to construct piasmid pT7pv1-5 by standard techniques. The polio protease gene (nucleotides 5438- 

5986) is excised from plasmid pT7pv1-5 as an Asu! restriction fragment (nucleotides 5435-5953). Synthetic 

oligonucleotides are used to recreate the upstream and downstream coding sequences, and to provide 
75 translation^ start and stop codons, as well as the Ndel and BamHI restriction sites previously described 

with regard to Figure 1. Following this procedure yields a plasmid bearing polio protease 3C under the 

control of the T7 promoter. 

In experiments analogous to those with HIV protease, applicants* data suggest that polio protease 3C is 
also toxic to E. coli strain BL21(DE3). As shown in Table IV below, the piasmid bearing polio protease 3C is 
20 unable to transform BL21(DE3) successfully, but does transform BL21(OE3)pLysS. The control plasmid 
pPol! (encoding polio polymerase as described in Table I) transforms both strains with approximately equal 
efficiency: 

TABLE IV 



No. of transformants 


Plasmid 


BL21(DE3) 


BL21(DE3)pLysS 


pPollo Protease 3C 

pPon 


0 
300 


150 
500 



These data indicate that polio protease 3C. like HIV protease, is toxic to this E. coli strain, BL21(DE3). when 
35 placed under the control of the T7 promoter. Therefore, a screening method for polio protease inhibitors can 

readily be devised. In such a screen, a compound to be tested is shown to be an inhibitor of polio protease 

by preventing killing of the bacterial strain BL21(DE3) and permitting successful transformations of that 

strain by a plasmid coding for the polio protease 3C. 

In turn, the results with HIV protease and polio protease 3C provide the basis for a generic screen for 
40 protease inhibitors. Of particular interest are compounds which inhibit other proteases which have been 

implicated in human diseases, such as human renin, trypsin and elastase. 

The particular target protease is placed in a plasmid under the control of the T7 promoter. The system 

is next shown to be toxic to E coli strain BL21(DE3), by failing to transform that strain even as strain BL21- 

(DE3)pLysS is transformed. Potential protease inhibitor compounds are then screened. Compounds which 
45 have inhibitory activity will be those which block the toxicity of the protease to BL21(DE3) and thereby 

permit transformations of that strain. 

The second screening method for inhibitors of protease enzymes is directed to the construction of 
fusion proteins comprising a reporter protein (meaning an easily assayed protein) into which a protease 
cleavage site has been introduced. The cleavage site corresponds to a portion of a natural substrate of the 

50 protease of interest . 

The reporter protein chosen is capable of being assayed for a change in color, light or activity when 
combined with a protease enzyme. The introduction of the cleavage site to create a protease "cleavage 
cassette" does not destroy the activity of the reporter protein. Thus, the fusion protein can be used in an 
assay system to screen for inhibitors of proteases. 

When the protease enzyme is added to the fusion protein containing the protease cleavage site, the 
protease cleaves at the protease site, thereby inactivating the reporter protein. This loss of activity is 
monitored by means of a color change, light change or by directly monitoring the loss of activity. A 
compound to be screened is included in the assay system and is identified as an inhibitor of the protease if 
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the reporter protein is not cleaved and remains active when monitored by any of the methods described 
above. 

Examples of reporter proteins that are utilized in constructing the fusion proteins and which are assayed 
by monitoring a color change with a color indicator compound include 0-gatactosidase, alkaline 
phosphatase. ^-glucuronidase and acetyltransferase. This system may be adapted for proteins specifying 
antibiotic resistance, such as lactamase. In a preferred embodiment of this invention, 0-galactosidase is 

used as the reporter protein. 

The 0-galactosidase enzyme encoded by the lacZ gene of E. coli is responsible for the conversion of 
lactose into glucose and galactose, allowing utilization of lactose as a carbon source (32). Because 0- 
galactosidase will also catalyze the hydrolysis of several other substrates which are converted into colored 
compounds. 0-galactosidase has proven an invaluable tool to monitor a wide range of biological processes. 
For example. 0-galactosidase fusions have been used to study transcriptional and translational regulation 
(33, and references therein). Fusion of heterologous protein to 0-gaIactosidase can facilitate the purification 
of that protein (34-37). In addition, interruption of the amino terminus of 0-galactosidase is the basis of 
identifying desired clones when using the pUC plasmid series (38). 

The demonstration in this invention that foreign protein sequences can be inserted into specific regions 
of jS-galactosidase. resulting in both the retention of 0-galactosidase activity and the accessibility of the 
foreign sequences to proteases, is novel. In the specific case of the protease cleavage cassette, the fact 
that 0-galactosidase can now be cleaved, and inactivated, by the cognate protease provides a simple 
means to monitor protease activity, and to search for inhibitors of these important enzymes. 

Examples of suitable color indicator compounds include X-gal and Fast Blue RR. A protein such as 0- 
galactosidase will react with X-gal to produce a blue colon when the protein is inactivated, no color change 
(white) is observed. A similar reaction of the protein with Fast Blue RR produces a red colon upon 
inactivation, no color change (white) is observed. 

Alternatively, the protein is chosen such that it catalyzes the fight-emitting chemical reaction of a protein 
derived from a btoluminescent or chemiiuminescent organism. However, when the protein is cleaved by a 
protease enzyme, the reaction cannot occur, such that no fight is emitted. The light may be observed with 
the naked eye. measured spectrophotometries or recorded on X-ray film. In a preferred embodiment the 
reporter protein used is luciferase. 

Additionally, the inactivation by the protease may be monitored by an in vitro system which directly 
measures the activity of the protein in a reaction with a substrate. 

In ail of the variants of the second embodiment of this invention, the fusion protein is expressed in a 
suitable expression system. The host is a bacterium, yeast insect cell, mammalian cell line, or any other 
suitable host expression system. In a preferred embodiment, the host is a strain of E. coli . 

As illustrative of the second screening method of this invention, the construction and use of an assay 
system for screening for inhibitors of HIV protease with j9-gaiactosidase and a color indicator will now be 

described. , . . . . _. 4< 

The initial step is to construct a plasmid containing the gene coding for 0-galactosidase. Plasmid pZMi 
is constructed as follows: E coli strain MC1081 (ara0139. A(arajeu)7697, A!acX74, gaiU. galK, hsdR. strA 
is obtained from Pharmacia. "Piasmid pZM1. containing the E. coli 0-galactosidase gene, is denved from 
pCH110 (Pharmacia). Plasmid pCHH0 contains an EcoRI site which is deleted by replacing the smaller 
Pstl/BamHI fragment from pCHIlO containing the jS-lactamase gene with the corresponding Pstl/BamHI 
fragment from a variant of pBR322 which lacks the EcoRI site normally present on that fragment. Next, the 
EcoRI/BamHI fragment containing the carboxy terminus of the 0-galactosidase gene is replaced with the 
oligonucleotide: 

51 AATTOCMCIGAGOGOOQCTO^ 3 f 

3 i QrjroGaciOGcrxccz^ 5' 



which recreates the carboxy terminus of *-galactosidase and places two stop codons at the end of the 
gene. This resulting plasmid is designated pZM1. 

The next step is to construct plasmids which encode wild type and mutant HIV protease, respectively, 
each under tac promoter control. These plasmids. designated p1+IQ and pl5-IQ. are constructed as 
follows: The protease genes from pMae2 (wild type) or p48Maee (mutant), as described previously, are 
excised as Ndel/BamHI fragments, made blunt by Klenow enzyme, and ligated into the Sma site of 
pKK223-3 (Pharmacia), creating p1 + and p15-. The Sphl/AlwNI fragment from p1 + and P 15-. containing 
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the HIV protease gene, is then ligated to the Pvul/AlwNI fragment containing the laclQ gene from pGEX2T 
(Pharmacia), creating pi +IQ and pt5-IQ (Fig. 5A). To delete the BamHI restriction site in the polylinker of 
p1 +IQ and pl5-IQ, these plasmids are partially digested with BamHI, made blunt with Klenow enzyme, and 
ligated. The resultant plasmids, designated pi +IQB and p15-IQB, contain a unique BaMHI site directly 

5 upstream of the tac promoter. 

Thereafter, an HIV protease cleavage site is inserted into jS-galactosidase by inserting an 
oligonucleotide into the 0-galactosidase gene of plasmid pZM-1. Studies from several laboratories using 
synthetic peptides which span the different cleavage sites of the gag and gag/pol polyproteins, have 
determined the mlnimai amino add sequence requirements for cleavage by HIV protease (11.12.15.16). The 

10 p8/PR site is cleaved most efficiently in vitro , and as few as seven amino acids are sufficient for cleavage 
(15). Based on these minimal requirements, a series of oligonucleotides ("cleavage cassettes") are 
synthesized. These oligonucleotides correspond to the ten amino acids which span the pB/PR junction, and 
contain the appropriate cohesive ends for insertion, in frame, into six different locations of the 0- 
galactosidase gene of plasmid pZM1 (Rg. 5B). 

rs The decapeptide Val-Ser-Phe-AsThPhe-Pro-Gln-lle-Thr-Leu corresponds to the p6/PR cleavage site of 
the HIV gag/pol polyprotein. Six pairs of oligonucleotides ("cleavage cassettes") encoding this decapeptide 
are synthesized. The oligonucleotides have in common the DNA sequence shown below containing a Hindlll 
restriction site (underlined), which encodes the decapeptide in each oligonucleotide pair 

20 vial Sar R» Asn Ebe Vm On lie lhr Leu 

5' GQLfiGCJXT AAC TTC OCX GAG NFC ACT CTO 3 f 
3* GAT TOG AAA TTO AAG GGk CTC TAG TEA GAC 5 V 

Each pair contains additional nucleotides specifying different cohesive ends designed for insertion, in 
the proper reading frame, into the unique BssH2, Cial, Drain, EcoRI, Saul or Sstt restriction sites of the E. 
coG /3-galactosidase gene of pZM1 (Rg. 5B). The six different sites in ^-galactosidase are selected because 
they are unique restriction sites within pZM1, and the cleavage cassette can be inserted by a simplified 
cloning procedure described below. The complete DNA sequences of the six pairs of oligonucleotides are 
as follows (with the HIV cleavage site set off by brackets): 



35 
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ECO 1/2 (xRI cohesive ends) : 

5 1 AA HT f OTA AGC TT T AAC TIC OCT CM ATC ACT CTG] C 3 1 
3 i A [c^x TOG AAA TIG AA6 OGA GTC TAG TGA GAC] GIT AA 5 V 

SST1/2 (xSstl cohesive ends) : 

51 g f GI\ AGC TT T AAC TTC OCT CM ATC ACT CTO] CM CP 3 f 

31 tc GAC [CAT TOG AAA TIG AM GGA GDC TAG TGA GAC] G S f 

CLA1/2 (xClal cohesive ends) 

5« C fCTA AGC TT T AAC TIC OCT CM ATC ACT CT ] 3' 

3* [ AT TOG AAA TIG AM GGA GIC TAG TGA GAG] C 5' 

DRA1/2 (xDrallX cohesive ends) 

5< G [ em AGC TT T AAC TTC OCT OG ACT AOS CT ] 3 9 

3 1 C GAC [GAT TOG AAA TIG AM GGA CTC TAA TG ] 5 1 

SAU1/2 (xMstllSaul cohesive ends) 

5' T GAA [G TA AGC TT T AAC TIC OCT CM ATT AOG CIG] AC 3* 
3« T [CAT TOG AAA TIG AM GGA GIC TAA TGC GAC] TGA CT 5 1 

BSS1/2 (xBssH2 cohesive ends) 

5' OG OGA [ GTA AGC TT T AAC TIC OCT CM ATT ACG CIG] A 3* 
31 T [CAT TOG AAA TIG AM GGA CTO TAA TGC GAC] TGC GC 5 1 



To simplify the cloning procedures only the cohesive ends of each oligonucleotide pair resemble the 
target restriction site, so that successful ligation of the oligonucleotide into the /3-galactosidase gene 
destroys that restriction site. Therefore, after ligation of the oligonucleotide into pZMi . the ligation mixture is 
digested with the target restriction enzyme to linearize any residual parent plasmid pZM1. After ethanol 
precipitation, the digested ligation mix is used for transformation of E. coli strain MC1061. Correct clones 
are identified by the presence of a new Hindlll site. This method results in a high proportion of correct 
clones and eliminates the need for phosphatase treatment of the vector and kinasing of the 

ohgo^uclwtid^ ^ designated pZM-Bss. pZM-CIa A. pZM-Dra. pZM-Eco. pZM-Sau. and pZM- 

Sst respectively. Also constructed is plasmid pZM-CIa B. which contains the cleavage cassette .n the 
reverse orientation encoding the unrelated decapeptide Qlu-Ser-Asp-Leu-Arg-Glu-Val-Lys-Ala-Tyr. which 
should not be recognized by HIV protease. In each case, insertion of the cleavage cassette within the B- 
gaiactosidase gene is confirmed by DNA sequencing. 
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To ascertain whether the desired £-galactosidase activity is retained despite the insertion of a cleavage 
cassette, E. coli strain MC 1061 harboring each construct is plated onto X-gal plates. In this assay, 
functional "J-galactosidase results in the formation of blue colonies, but lack of jS-galactosidase activity 
produces white colonies. Using procedures described below, each construct is also tested for quantitative 
5 0-galactosidase activity and for whether it is cleaved by HIV protease in vitro and in vivo . Table V sets 
forth the results of this assay: 

TABLE V 



10 


Beta-Gaiactosidase Activity of Cleavage Cassette Constructs, and 






Susceptibility to Cleavage by HIV Protease 






Plasmid 


Colony color on 


beta-gaJactosidase 


Cleavage by HIV 






X-gal plates 


activity (units) 1 


protease 2 


IS 








in vitro 


in vivo 




pZM-Bss 


Blue 


5 


no 


no 




pZM-CIa A 


Blue 


10 


no 


no 




pZM-Cla B 


Blue 


163 


no 


no 


20 


pZM-Dra 


White 


0 


no 


n.d. 3 




pZM-Eco 


Blue 


2 


no 


no 




pZM-Sau 


Blue 


63 


yes 


yes 




pZM-Sst 


White 


0 


no 


n.d. 




pZM1 


Blue 


1179 


no 


n.d. 


25 


none 


White 


0 


• 





1 beta-galactosidase activity is determined by hydrolysis of ONPG 

2 cleavage of beta-gaiactostdase by HIV protease is determined by Western analysis 
(see Figures 6 and 8) 

3 n.d. = not done 



Insertion of the cleavage cassette at the Saul, Ctal. BssH2 or EcoRI sites results in blue colonies, 
indicating that the enzymatic activity of jS-galactosidase is retained, insertion of the cleavage cassette in the 

35 opposite orientation in the Clal site such that it encodes a decapeptide unrelated to the HIV protease 
cleavage site (Table V, pZM-CIa B) also results In theformation of blue colonies. These data indicate that 
the 0-galactosidase protein can tolerate insertions at several locations and still retain detectable enzymatic 
activity. Conversely, insertion at either Drain or Sstl destroys 0-galactosidase activity, since colonies 
harboring these constructs are white. 

40 The intensity of blue color in strains harboring the pZM plasmids Bss. Cla A, Cla B. Eco and Sau 
exhibits considerable variation, and in each case is less intense than that conferred by the parent plasmid 
pZM1. 0-gaJactosidase activity is quantitated by hydrolysis of O-nitrophenyl-^-D-galactoside (ONPG) (39). 
Plates contain 5^romo-4^loro-3^ndoiyl-^-r>galactoside (X-gal) at 40 ug/ml. The assays use a modifica- 
tion of those described by Miller (40). Ceils are grown m Luria broth to an absorbance of 0.5 (600 

45 nanometers). Cells (0.5 ml) are pelleted and resuspended in 2 ml Z buffer (0.1 M Na3PO*. pH 7.0, 1 mM 
MgSO*. 02 mM MnSO*. 0.1 M 0-mercaptoethanol, 0.2 mg/ml CTAB (hexadecyitrimethylammonium 
bromide), 0.1 mg/ml sodium deoxycholate), and incubated for five minutes at 28* C. Next, 0.6 ml of 4 mg/ml 
ONPG in 0.1 M NaaPO*. pH 7.0 are added, and incubated at 28 'C until the solution turns yellow (10 
minutes to overnight). To stop the reaction, 1.3 ml of 1 M Na 2 C0 3 are added. The absorbance is then read 

so at 420 and 550 nanometers. The units of 0-gaiactosidase are calcuated using the following equation: 

units - (A 42Q^ 1 ' 75) X A S5Q 
55 600 

where A = absorbance. t = time of reaction in minutes, and v = volume of culture used in ml. 
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The results of the quantitative assay are also shown in Table V. The blue clones exhibit significantly less fi- 
galactosidase activity than the parent plasmid pZM1. ranging from about 10 fold less (pZM-CIa B) to about 
500 fold less (pZM-Eco). Without being bound by the following, it is proposed that the reduced &- 
galactosidase activity observed upon insertion of the cleavage cassette may result from a change in the 
structure of this protein which attenuates its enzymatic function, or from 0-gaJactosidase/cleavage cassette 
protein which displays wild-type enzymatic activity, but is present in reduced amounts, or from a 
combination of these two factors. As expected, white clones pZM-Dra and pZM-Sst do not produce 
detectable levels of /s-galactosidase activity in this assay. 

The 0-gaJactosidase/cleavage cassette proteins are next tested for susceptibility to cleavage by HIV 
protease in vitro . Two samples of E coli strain MC1061. one harboring protease plasmid pi +10 and the 
other plasTnid-piWQ. <"> grown to 0.5 absorbance units (595 nanometers). Protease synthesis is induced 
by addition of IPTG to 1 mM. After one hour, cells are pelleted, resuspended in 0.66 volumes M buffer (50 
mM 2-(N-morpholino)ethanesulfonic acid (MES). pH 6. 1 mM dithiothreitol. and 25 mM NaCI) containing 1 
mM EDTA and 1 mg/ml lysozyme, and incubated on ice for 10 minutes. Cells are pelleted and resuspended 
in 0 01 volume M buffer, and lysed by three freeze/thaw cycles (freezing rapidly in dry ice/ethanol. and 
thawing slowly in ice water). After the final thawing, the extract is centrlfuged and the dear supernatant is 

used as a source of protease. 

The same method is used to prepare extracts containing ^-galactosidase from E coH MC1081 cells 
harboring the pZM plasmid series described previously, except mat IPTQ is omitted and 0.1 volume of M 
buffer is used for the freeze/thaw cycle. For cleavage reactions. 20 ul of /S-galactosidase extract is typically 
incubated with 5 ul of protease extract at 37* for one hour, followed by electrophoresis of incubation 
reactions on SDS-polyacry (amide gels. /J-gatactosidase is visualized by Western blot analysis using house 
artM-galactosldase polyclonal serum (Sigma) and the ImmunoSelect kit (Bethesda Research Laboratories). 
"S-methionine-labelled control substrate corresponding to the HIV gag polyprotein (nt 109-1417) (4) is 
produced from the T7 promoter in E coli strain BL21(OE3)pLysS. 

An extract of E coli strain MC1081 containing wild type protease (from p1 +IQ). but not mutant 
protease (from pIsTajTis able to cleave a gag polyprotein which is a natural substrate for HIV protease 
(Rg 8A) Extracts of E coli harboring 0-galactosidase/deavage cassette constructs are incubated with 
either wild-type or mutartTiiv protease extract, and cleavage of galactosidase is monitored by Western 
analysis using anti- /S-galactosidase antibodies (Rg. 6B). Native ^-galactosidase encoded by pZM1 is 
unaffected by these Incubations (lanes 4-6). Of the six /j-galactosidase/cleavage cassette proteins exam- 
ined, five of these (Bss. Cla A. Dra. Eco and Sst) are likewise undigested by HIV protease (Table V). as is 
the Qa B clone containing a reversed decapeptlde insertion used as a control. 

In contrast the /J-galactosidase/cleavage cassette encoded by two independent pZM-Sau clones, is 
cleaved by wild type HIV protease (lanes 8 and 1 1). producing a cleavage product of approximately 100 kd. 
consistent with cleavage at the Saul insertion site located 80 amino acids from the amino terminus. The 
other expected cleavage product of approximately 8 kd is not detected and may be unstable, incubation of 
the pZM-Sau 0-galactosidase/cleavage cassette with mutant HIV protease (Fig. 6B. lanes 9 and 12) fails to 
produce cleavage, indicating that the cleavage event is mediated by HIV protease and not by an E coll 

Pr ° t To S8 further examine the basis for reduced /r-galactosldase activity in the ^alactosidase/cleavage 
cassette clones, the amount of galactosidase is quantitated by Western analysis, using anti-ZJ-ga ac- 
tosidase antibody. It is found that the pZM-Sau construct has significantly less /j-galactosidase protein 
compared to the parent pZM1 (Rg. 6B. compare the intensity of the /J-galactosidase band in lane 4 with 
lanes 7 and 10). This difference is quantitated more accurately by Western analysis of different dilutions of 
cells harboring pZM1 and pZM-Sau. and it is determined that the pZM-Sau clone has about 10-20 fold less 
galactosidase protein than the parent done pZM1 (data not shown). Thus, for pZM-Sau. a reduced 
amount of /J-galactosidase protein with approximately wild-type levels of enzymatic activity, could account 
for the reduction of /S-galactosidase activity observed on X-gal plates and in the ONPQ assay (Table V). 

Western analysis on the other cleavage cassette clones demonstrates similarly reduced levels of p- 
galactosidase protein suffident to explain their decrease in enzymatic activity. The non-functional /J- 
galactosidase proteins encoded by pZM-Dra and pZM-Sst are likewise present in reduced amounts. Since 
the native ^-galactosidase encoded by pZM1 and the altered /S-galactosidase/cleavage cassette proteins 
are expressed from identical vectors which differ only by the insertion of the cleavage cassette in , the g- 
galactosidase gene, it appears that the 0-galactosidase/cleavage cassette proteins are less stable than the 

"^n STer experiment pepstatjn A. a known inhibitor of HIV protease (34). is tested as to whether 
cleavage of the /j-galactosidase/cleavage cassette protein pZM-Sau incubated with HIV protease is inhibited 
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by this compound. Pepstatin A at 1 ug/ml completely inhibits the cleavage of the altered 0-galactosidase, 

further indicating that the cleavage is due to HIV protease (Figure 7). 

The results in vitro demonstrate that with the exception of the pZM-Sau construct, 0- 

galactosidase/cleavage cassette proteins are not cleaved by HIV protease. The cleavage cassette in the 
5 protease-resistant proteins might be buried in the interior of the proteins and so be inaccessible to the 

protease in an in vitro assay. For this reason, it is useful next to introduce HIV protease into the same cell 

as these /3-gaiactosidase/cleavage cassette proteins, to allow for co-transtational cleavage in vivo . 

To test for cleavage of the 0-gaJactosidase/cleavage cassette encoded by pZM-Sau in vivo , this altered 

0-galactosidase gene is subcloned into the plasmids p1+IQ and p15-IQ, which encode wild type and 
ro mutant HIV protease, respectively. Specifically, the altered 0-galactosidase gene from pZM-Sau is excised 

as a Stul-BamHI fragment and ligated into the BamHI site of pt + IQ and p15-IQ, creating plasmids 

p1 + IQSau and p15~IQSau, respectively. Samples of an E. coli strain MC1061 harboring plasmld p1 +IQSau 

have been deposited with the American Type Culture Collection and have been assigned ATCC accession 

number 68,352. 

is For comparison purposes, the 0-gaiactosldase/cleavage cassette regions of pZM-Bss, pZM-CIa A, and 
pZM-Eco are also subcloned into p1 + IQ. The altered 0-gaJactosidase produced by pZM-CIa B is used as a 
control. Since pZM-Dra and pZM-Sst produce non-functional 0-gaJactosidase, these will not be considered 
further. 

Total protein from cells harboring these resultant plasmids p1 +IQSau and p15-IQSau is subjected to 
20 SDS-PAGE, and cleavage of 0-galactosidase is monitored by Western analysis (Rg. 8). This 0- 
galactosidase/cleavage cassette is cleaved in vivo by wild type HIV protease (panel A, lane 1 , and panel B, 
lane 2), but not by mutant protease (panel ~B, lane 3), generating a cleavage product of the same size as 
that observed in vitro (Fig. 6B, lanes 8 and 11). It is estimated that approximately 50-75% of the 0— 
gaiactosidase/cieavage cassette protein is cleaved. These cleavage assays are done in liquid culture. The 
25 amount of cleavage which occurs in colonies is not directly quantitated, but E. coli MC1061 harboring 
p1 +IQSau are white on X-gaJ plates, whereas p1 5-IQSau confers blue color. These data are consistent with 
cleavage of the ^-galactosidase/cleavage cassette by wild type, but not by mutant HIV protease. 

HIV protease does not cleave any of these other ^-galactosidase/cleavage cassette proteins in vivo that 
are protease-resistant in vitro (Fig. 8A, lanes 2-5. and Table V). As expected, cells harboring p1 + IQBss, 
30 p1 + IQCIa A, p1 + IQCIa* B, or pi + IQEco are blue on X-gal plates, consistent with lack of cleavage by HIV 
protease. 

The results of another HIV protease in vivo cleavage assay are shown in the photograph of Figure 9. 
Samples of E coli strain MC1061 harboring various plasmids are spotted onto plates containing the color 
indicator X-gal. The plasmids are pZM-1 (a positive control for 0-galactosidase activity), pZM-2 (a negative 

35 control for 0-gaJactosidase activity), pZM-Sau, p1 + IQSau (labelled 1 + IQB-Sau) and p15-IQSau (labelled 
p15-IQB-Sau). In the cleavage assay with wild-type HIV protease, constructs which cleave /3-gaiactosidase 
produce a white color, while constructs which do not cleave 0-galactosidase produce a blue color. Plasmid 
pZM-2 encodes mutant, inactive 0-galactosidase, and so produces white colonies. Plasmid pZM-2 is 
constructed by digesting plasmid pZM-1 with EcoRI and BamHI to delete the DNA sequence coding for the 

40 18 amino acids at the carboxy-terminal end of 0-gaiactosidase. As expected. pZM-1 produces a blue color, 
as do pZMSau and p15-IQSau. Plasmid pZM-2, as a negative control, produces a white colon 0- 
gaiactosidase produced by p1 + IQSau is cleaved by the protease and produces a white color. 

The 0-galactosidase protein which has been modified by insertion of a viral protease cleavage cassette, 
can be used in a screen to identify inhibitors of that protease, or of any other protease whose cleavage 

45 sequence Is known and can be introduced into the Saul site of l-galactosidase. 

The plasmid p1 + IQSau in an appropriate E. coli strain, such as BAS849 (41) (which has a mutation in 
the outer membrane, rendering it more permeable to various compounds), is plated as a lawn onto an agar 
plate. Alternatively, a liquid culture may be used as a nutrient medium. The plate should contain sufficient 
IPTG to induce HIV protease synthesis, cleaving l-gaiactosidase and rendering the lawn colorless when 

so assayed with 0-gaJactosidase color indicators, such as X-gaJ (40), Fast Blue RR (42) or others. Possible 
inhibitors of the protease are spotted onto the lawn, which is then overlayed with the indicator. 

A protease inhibitor will inactivate the protease, preventing cleavage of jS-galactosidase. The 0- 
galactosidase then reacts with the indicator, causing the spot to become colored (blue in the case of X-GaJ, 
red in the case of Fast Blue RR)* Thus, protease inhibitors are identified. 

55 A similar screening method may be used where a light-emitting reaction is monitored, rather than a 
color change. The cleavage cassette is inserted into a protein such as lucif erase (Clontech Laboratories, 
Inc., Palo Alto, CA), rather than 0-galactosidase. In the absence of a protease enzyme, the 
luciferase/cleavage cassette protein catalyzes the light-emitting chemical reaction. In the presence of 



18 



EP 0 421 109 A2 



luciferin and ATP an enzyme-bound luciferyl-adenylate complex is formed, which is followed by oxidative 
S« reason products are CO,, oxylucifenn and 

The light emission may be observed with the naked eye. measured spectrophotometncally or recorded on 

^We™ " a protease enzyme is present, it cleaves at the protease enzyme cleavage site, thereby blocking 
the Son SSd by^uciferase. such that no light is emitted. In a screen for protease mh.btors^a test 
^rnp^d deluded In the assay system. If the compound prevents cleava^ by protease, the 
n3 proceeds and light is emitted, thereby identifying the compound as a protease .nh.bttor 

r^SZ above in both the color indicator and light-emitting reaction in rfyo assays tha e^ressjon 
0 syst^^S genes coding for the ^-ga.actos.dase/c.eavage cassette protein and H.V protease. The 

vitro^a' SSSS TyZ^Z 'only the gene coding for the ^acto^c^age cas^tte 
STtL HIvTrotease is generated separately and kept apart from the fuaon protem unUl ^ttmecrf 
At iL timTttie ourtfied 8-galactosidase/cleavage cassette protein is rmxed with HIV protease 

* Star appropriate site for insertion of heterologous protein sequences wrthout destroymg H^iac 

^Tsh^Tn Figure 10. this fusion protein is cleaved by polio 3C protease (lane 5). which >» 

in BmSoLysS in similar fashion as Nicklin et al. (44). but not by HIV protease (lane 6Ms expected. 

SSint used to screen for inhibitors of polio 3C protease, using me methods desenbed 

30 previously. 
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Claims 

1. A method for screening for compounds which inhibit protease enzymes, which comprises the steps of: 

(a) constructing a plasmid having a DNA sequence which encodes a protease under the control of a T7 
promoter; 

(b) selecting a bacterial expression strain which is not transformed by said plasmid because of the 
toxicity of the protease to the bactBrial strain; and 

(c) adding a compound to be screened together with the plasmid to the bacterial strain such that if the 
bacterial strain is not Wiled and produces transformants. the compound is identified as an inhibitor of the 
protease. 

2. The method of claim 1, wherein the bacterial strain is E coji BL21(DE3) and the plasmid has a DNA 
sequence which encodes human immunodeficiency virus protease. 

3. The method of claim 1, wherein the plasmid is pPolio protease 3C and the plasmid has a DNA sequence 
which encodes polio protease 3C. 

4. A method for screening for compounds which inhibit human immunodeficiency virus protease, which 
comprises the steps of: 

(a) constructing a plasmid having a DNA sequence which encodes human immunodeficiency virus 
protease under the control of a T7 promoter; 

(b) selecting a bacterial strain transformed by said plasmid. which strain is killed by the induction of 
protease synthesis by the inducing agent isopropyh^D-thiogaJactopyranside; and 

(c) adding a compound to be screened together with isopropyl-tf-D-thiogalactopyranoside to the trans- 
formed bacterial strain such that if the induced cells are not killed, the compound is identified as an 
inhibitor of the human immunodeficiency virus protease. 

5. The method of claim 4. wherein the bacterial strain is E. coli BL21(DE3)pLysS. 

6. A method of preventing the expression of enzymatically active 10 kd human immunodeficiency virus 
protease, which comprises: (a) introducing a mutation into the DNA sequence which encodes the 18 kd 
protease precursor of the 10 kd protease, thereby preventing autodigestion of the precursor which would 
otherwise liberate the mature, enzymatically active 10 kd protease: or (b) introducing a mutation into the 
DNA sequence which encodes the 10 kd protease. 

7. A method of producing non-functional 10 kd human immunodeficiency virus protease, which comprises 
modifying the DNA sequence which encodes the 18 kd protease precursor of the 10 kd protease, such that 
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the scissile Phe-Pro bond located at amino acids -1 to 1 of the mature 10 kd protease is disrupted* 

8. A method for screening for compounds which inhibit protease enzymes, which comprises the steps of: 

(I) 

(a) selecting a reporter protein which is capable of being assayed by reacting with a color indicator 
s compound to produce a color change; 

(b) inserting into the gene coding for said protein a heterologous DNA sequence coding for a peptide 
which includes a protease enzyme cleavage site; 

(c) inserting into a suitable expression system the construct of step (b) and a gene coding for a 
protease enzyme which recognizes and cleaves the protease enzyme cleavage site of the peptide 

w coded for by the heterologous DNA sequence of step (b); 

(d) preparing an assay system which comprises: 
(i) the expression system of step (c); 

(if) a color indicator compound; and 
(iii) a nutrient medium; and 

75 (e) adding a compound to be screened to the assay system, such that, if the compound inhibits the 
protease enzyme, the expression system produces an intact construct of step (b). the reporter protein 
retains its activity and color is maintained in the assay system, but if the compound is not an inhibitor, 
the protease cleaves the protease enzyme cleavage site of the peptide of step (b), the reporter protein 
is inactivated and no color is observed in the assay system; or 

20 (II) 

(a) selecting a reporter protein which is capable of catalyzing a light-emitting chemical reaction; 

(b) inserting into the gene coding for said reporter protein a heterologous DNA sequence coding for a 
peptide which includes a protease enzyme cleavage site; 

(c) inserting into a suitable expression system the construct of step (b) and a gene coding for a 
26 protease enzyme which recognizes and cleaves the protease enzyme cleavage site of the peptide 

coded for by the heterologous DNA sequence of step (b); 

(d) preparing an assay system which comprises: 

(i) the expression system of step (c); and 

(ii) a compound which undergoes a light-emitting chemical reaction catalyzed by the reporter 
30 protein of step (a): and 

(e) adding a compound to be screened to the assay system, such that, if the compound inhibits the 
protease enzyme, the expression system produces an intact construct of step (b), the reporter protein 
retains its catalytic activity and a light-emitting chemical reaction occurs, but if the compound is not 
an inhibitor, the protease cleaves the protease enzyme cleavage site of the peptide of step (b), the 

35 reporter protein is inactivated and the light-emitting chemical reaction does not occur. 

9. A method of claim 8. wherein in (I) the reporter protein of step (a) is l-galactosidase, in (II) the reporter 
protein of step (a) is luctferase; and in either (I) or (II), the protease cleavage site of step (b) is the human 
immunodeficiency virus pB/PR gag /poi polyprotein cleavage site, the expression system of step (c) 
includes an E . coli host strain, and the protease enzyme of step (d)(i) is the human immunodeficiency virus 

40 protease. ~ 

10. A method of claim 8, wherein in either (I) or (II) the protease cleavage site of step (b) is a polio 3C 
protease cleavage site and the protease enzyme of step (d)(i) is polio 3C protease. 

1 1 . A method for screening for compounds which inhibit protease enzymes, which comprises the steps of: 
(a) selecting a reporter protein whose activity is capable of being monitored by reacting with a substrate; 

45 (b) inserting into the gene coding for said reporter protein a heterologous DNA sequence coding for a 
peptide which includes a protease enzyme cleavage site; 

(c) inserting into a suitable expression system the construct of step (b); 

(d) preparing an assay system which comprises: 

(i) a protease enzyme which recognizes and cleaves the protease enzyme cleavage site of the 
so peptide coded for by the heterologous DNA sequence of step (b); 

(ii) the expression system of step (c) harboring the construct of step (b); and 

(iii) a substrate which reacts with the reporter protein of step (a); and 

(e) adding a compound to be screened to the assay system, such that if the compound inhibits the 
protease enzyme, the expression system produces an intact construct of step (b), and the reporter 

55 protein retains its activity as monitored by reaction with the substrate, but if the compound is not an 
inhibitor, the protease cleaves the protease enzyme cleavage site of the peptide of step (b), and the 
reporter protein is inactivated. 

12. A method of claim 11, wherein the protein of step (a) is 0-galactosidase, the protease cleavage site of 
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step (b) is the human immunodeficiency virus p6/PR gag /pol polyprotein cleavage site, the expression 
system of step (c) includes an E. coli host strain, the" protease enzyme of step (d)(i) is the human 
immunodeficiency virus protease, and the substrate of step (d)(iii) is O-nitrophenyhjS-D-galactoside. 

13. A method of claim 11, wherein the protease cleavage site of step (b) is a polio 3C protease cleavage 
5 site and the protease enzyme of step (d)(i) is polio 3C protease. 

14. A piasmid selected from the group consisting of plasmids designated pPDB16, pMae2, p1 +IQSau and 
pZM3CSau. 

15. A bacterial strain harboring a piasmid selected from the group consisting of E coli strain DH5a 
harboring piasmid pPDB16 (ATCC 68005), E coli strain BL21(DE3) harboring piasmid pP48 (ATCC 68004) 

io and E coli strain MC1061 harboring piasmid p1 +IQSau (ATCC 68,352). 
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SEQUENCE LISTING 
INFORMATION TOR SEQ ID KOt is 
SEQUENCE TYPE* Oligonucleotide 
SEQUENCE LENGTHS 60 nucleotide* 

FRAGMENT TYPE: Carboxy terminus of l-galactoaidaae plus 
two atop eodons 

AAT TCC AOC TGA GCG CCQ GTC GCT ACC ATT ACC AGT TOG TCI 42 
GGX OTC AAA AAT AAT AAG 60 

INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE TYPE: Oligonucleotide 
SEQUENCE LENGTH: 30 nucleotide* 

FRAGMENT TYPEs Internal coding region of the HIV gag/go; 
polyproteln 

FEATURES t Codea for a decapeptide corresponding to the 
p6/PR cleavage aita of the HIV qaq/pol polyprotein 

GTA AflC TTT AAC TTC CCT CAG ATC ACT CTG 30 

Val Ser Phe Aan Pha Pro Gin lie Thr Leu 

15 10 
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INFORMATION FOR SSQ ZD NOt 3 J 
SEQUENCE TYPE i Oligonucleotide 
SEQUENCE LENGTH: 36 nucleotides 

FRAGMENT TYFEs Modified internal coding region of the HIV 
gftq/ttoi polyprotein 

MATCHES! Codes for a decapeptide corresponding to the 
p6/PR oleavage sits of ths HXV gaslSSl polyprotein, and 
contains a cohesivs snd for ths EcoRI rsstriction sits of 
ths 2« fiflii 0-galactoaidaae gshs of plaamid pZMl. 

AATTT OTA AfiC TTT AAC TTC CCT CAG ATC ACT CTG C 36 
INFORMATION FOR SEQ 10 NO: 41 

SEQUENCE TYPE! Oligonucleotide 

SEQUENCE LENGTH: 36 nucleotides 

FRAGMENT TYPE* Modified internal coding region of the HIV 
gup/pal polyprotein 

* 

FSATOXSSt Codaa for a dacapaptida corraaponding to tha 
p«/PR elaavaga aita of tha HIV S&aZfiSl polyprotain, and 
eontalna a cohaaiva and for tha SatI raatrietion aita of 
tha £. aoll j-galaotoaidaaa gana of plaamid pZMl. 
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DTTOWWTION POR SSQ ID Wt Ji 
SBQUEHCE TYPBt Oligonuclaotida 
SEQUENCE LENGTH: 3« nuclootida. 

ntASXBiT TtVZt Koditiad intarnal coding ragion of to. HIV 
7 «a/Bol polyprotain 

RATOTESi cod., for a doeapoptldo eorra.ponding to to. 
b«/P» olaavag. .if of too HIV Mfl/Ml polyprotoin, and 
contain, a coho.iv. and for to. EcoRl raatriction ait. of 
too a. flail i»-galaotoaldaaa gana of plaamid pail. 



Mm OTA AfiC TW AAC TTC CCT CAfl AXC ACT CTO C 
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IHPOJWATIOM POR SEQ ID NO: 41 

SEQUENCE WBt oilgonuclaotida 
SEQUENCE LENGTH: 36 nuelaotidaa 

PKXOKBMT TYPE. Kodifiad Into rnal coding ragion of to. HIV 

gaa^aal poiyprot.in 

maUMS, cod., for a docapaptida corraaponding to to. 
»«/FR elaavag. aita of toa HIV gAS/SSl polyprotain, and 
fon^inaTcL-iv. .nd for to. S.tl ra.tricti.n .it. of 
th. S. flaU tf-galacto.idaa. gan. of plaamid pZMi. 
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0 ©TA AOC TTT AAC TTC CCT CAQ ATC ACT CTG CACCT i 
HfFOBKATION FOR S2Q ID HO J 5: 

SSQOEHCS Tim Oligonucleotide 
SIQUZHCS LXKGTH: 30 nucleotides 

PRA6NSHT TYPE J Modified internal coding region of the 
HIV ya/Bol polyprotain 

nATORlSs Codas for a decapeptide corresponding to the 
p6/» oleavage alta of tha HIV yg'poi pelyprotain, and 
contain, a oohaaiva end for tha Clal raatriction aita of 
tha B» g Qli A-galactoaidaee gana of plaaaid pZMl. 

C OTA AGC TTT AAC TTC CCT CAG ATC ACS CT 



IKPOSMATXON FOR SSQ ID NOt «J 

SBQOSHca WPBt oligonucleotide 

SEQUENCE LENOTHl 30 nuclaotidaa 

nutSMSNT TYPE J Modified intamal coding region of tha HIV 
g.a/pol polyprotain 

FEATURES: Codas for a dacapaptida corresponding to tha 
„i.„™« Bit. of the HIV aaaZBfll polyprotein, and 
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contains a cohesive end for the Drain reetriotion aite of 
the £. ffoli J-galactoaidaae gene of plaemid pZMl. 

0 OTX AOC TTT AAC TTC CCT CAO ATT ACG CT 30 



IH90RMATXON FOR S1Q ID NO: 7: 
SEQUENCE WW i Oligonucleotide 
SEQUENCE LENGTH: 36 nucleotide* 

FRAGMENT Tmi Modified internal coding region of the KIV 
aaa/col polyprotain 

FEATURES l Codea for a dacapeptide corraaponding to tha 
p6/P* olaavage aite of tha HIV gaalfi&L polyprotain, and 
containa a eoheeive end for the Saul reatrietion aite of 
the £. goli 0-galaotoaidaae gene of plaeoid pfHl. 

TOAA OTA AOC TTT AAC TTC CCT CAO ATT ACO CTO AC 3« 



27 



EP 0 421 109 A2 



DTTORMXTIOH FOR SEQ ID KOt 8t 
SEQUENCE TYPBt Oligonucleotide 
SEQTJBHCB LENGTH: 3« nueleotidee 

FRAGMENT TYPBt Modified internal ceding region of the HIV 
gna/pol polyprotein 

FEATURES t Codee for a deeapeptide oorreeponding to the 
pa/PR cleavage site of the RTV ggq/pgl polyprotein, and 
eontaina a cohesive and for the SasHl restriction aite of 
the s> call j-galactoaidase gene of plasaid p2Hl. 

COCOa OTA AGC TTT AAC TTC CCT CAO ATT ACQ CTG A 36 



INFORMATION ?0R SEQ ZD KOt 9t 
SEQUENCE TYPBt Peptide 
SEQUENCE LENGTH: 10 amino acida 

FRAGMENT TYPBt Encoded by cleavage cassette in reverse 
orientation of plasmid pZM-ClaB 

fEATORESt Hot recognized by HIV protease 

olu Ber xep Arg ciu Val Lya Ala Tyr 

5 « 
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INFORMATION FOR SZQ ID NO J 10 

82QUZNCZ TYP1: Paptida 

8BQOSNCE LENGTH* 13 amino aoida 

FRAGMENT TYPE* Polio 3C protaaaa claavacr* sits 

Mat Oltt Ala Lau Pha Gin Oly Pro Lau Gin Tyr Lya Aap 

15 10 
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FIG. 2A FIG. 2B 
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FIG. 4 
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FIG. 5B 
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FIG. 7 
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FIG. 8A 
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FIG. 9 
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© Screening methods for protease inhibitors. 



© The invention provides methods to screen for 
inhibitors of proteases. The first method involves the 
construction of a plasm id system such that expres- 
sion of a protease is toxic to a bacterial strain. 
Compounds of interest may then be screened for 
those which inhibit the toxic protease and thereby 
prevent killing of the bacterial strain. Inhibition is 
detected by successful transformations of the bac- 
terial strain by the plasmid coding for the protease. 



The second method involves the construction of a 
fusion protein which contains a protease cleavage 
site inserted into a protein whose activity may be 
assayed. Compounds are screened for protease in- 
hibition, which is detected by assaying for a change 
in color, light or activity of the protein. HIV protease 
and polio 3C protease are examples of proteases 
that may be used in these methods. 
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